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ABSTRACT 

We describe the Carnegie-Spitzer-IMACS (CSI) Survey, a wide-field, near-IR selected spectre-pho- 
tometric redshift survey with the Inamori Magellan Areal Camera and Spectrograph (IMACS) on 
Magellan-Baade. By defining a flux-limited sample of galaxies in Spitzer IRAC 3.6/im imaging of 
SWIRE fields, the CSI Survey efficiently traces the stellar mass of average galaxies to z ~ 1.5. This 
first paper provides an overview of the survey selection, observations, processing of the photometry 
and spectrophotometry. We also describe the processing of the data: new methods of fitting synthetic 
templates of spectral energy distributions are used to derive redshifts, stellar masses, emission line 
luminosities, and coarse information on recent star-formation. Our unique methodology for analyzing 
low-dispersion spectra taken with multilayer prisms in IMACS, combined with panchromatic photom- 
etry from the ultraviolet to the IR, has yielded 37,000 high quality redshifts in our first 5.3 degs 2 of the 
SWIRE XMM-LSS field. We use three different approaches to estimate our redshift errors and find 
robust agreement. Over the full range of 3.6/im fluxes of our selection, we find typical uncertainties 
of cr z /(l + z) < 0.015. In comparisons with previously published VVDS redshifts, for example, we 
find a scatter of cr z /(l + z) = 0.012 for galaxies at 0.8 < z < 1.2. For galaxies brighter and fainter 
than i = 23 mag, we find cr z /(l + z) = 0.009 and o z j(\ + z) = 0.025, respectively. Notably, our 
low-dispersion spectroscopy and analysis yields comparable redshift uncertainties and success rates 
for both red and blue galaxies, largely eliminating color-based systematics that can seriously bias 
observed dependencies of galaxy evolution on environment. 

Subject headings: galaxies: evolution — galaxies: high-redshift — galaxies: stellar content — infrared: 
galaxies 



1. INTRODUCTION 

Understanding the evolution of galaxies and large scale 
structure remains a fundamental challenge in astro- 
physics. Many ambitious galaxy surveys have been car- 
ried out to address this problem, but limited time on 
large telescopes results in a classic problem: sky cover- 
age, depth, and spectral resolution - choose two. For 
example, the very- wide- area Sloan Digital Sky Survey 
(SDSS) provides a wealth of spectral information for 
galaxies over a cosmologically significant volume, but its 
modest depth limits the SDSS to the relatively local, 
'modern' universe. Conversely, the Hubble Ultra-Deep 
Field photometric survey probes deep into cosmic time 
- to the epoch of reionization, but only over a tiny vol- 
ume whose small population of galaxies leads to large un- 
certainties on their physical properties. While the union 
of such surveys has begun to paint a coherent picture of 
galaxy growth, significant patches of blank canvas limit 
our ability to fully describe how environmental processes, 
the infall of gas that fuels star formation, galaxy mergers 
and acquisitions, and feedback, all shape the evolution of 
galaxies over cosmic time. 
The Carnegie-Spitzer-IMACS Survey (CSI) has been 

1 This paper includes data gathered with the 6.5 meter Mag- 
ellan Telescopes located at Las Campanas Observatory, Chile. 

2 Visiting Astronomer, Kitt Peak National Observatory, Na- 
tional Optical Astronomy Observatory, which is operated by the 
Association of Universities for Research in Astronomy (AURA) 
under cooperative agreement with the National Science Founda- 
tion. 



designed to address one of the most dramatic and least 
understood features of galaxy evolution — the remark- 
ably rapid decline in cosmic star formation since z ~ 1.5. 
It is during this extended epoch of galaxy maturation 
that galaxy groups and clusters have also emerged as a 
conspicuous feature of the landscape. The CSI Survey is 
uniquely able to link together the evolution of individ- 
ual galaxies with these features of large-scale structure 
growth. 

In our ambitious spectrophotometric redshift survey of 
distant galaxies, we strike a balance between the afore- 
mentioned three factors: (1) high completeness to mod- 
erate redshift [z < 1.5), (2) spectral resolution intermedi- 
ate between conventional photometric and spectroscopic 
surveys (combining the efficiency of imaging surveys with 
a spectral resolution high enough to resolve large-scale 
structure and prominent emission-lines); and (3) an un- 
precedented area of 15 deg 2 for a z > 1 survey. This gives 
the CSI Survey a volume comparable to the SDSS, with a 
selection method that efficiently traces stellar mass over 
2/3 the age of the universe (0.4 < z < 1.5) — span- 
ning the critical redshift range where cosmic star forma- 
tion precipitously drops, and groups and clusters become 
prominent. As the redshift survey with the largest unbi- 
ased volume at z = 1, CSI will allow us to comprehen- 
sively address the interplay between environment, galaxy 
mass buildup, and star formation at these redshifts. 

In this, the first paper of the CSI Survey, we describe 
the design of the project, with particular focus on the flux 
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limits of the selection, the data processing and the spec- 
tral energy distribution (SED) fitting with its attendant 
determination of redshifts and redshift errors. We apply 
our methodologies to the first batch of data in 5 degs 2 
of the SWIRE XMM-LSS field, and provide an overview 
of some basic properties of galaxies over the past 9 Gyr, 
with an emphasis on the gains made by selecting galaxies 
by stellar mass instead of by their rest-frame UV light. 

2. THE CARNEGIE-SPITZER-IMACS SURVEY 

2.1. The limitations of optically-selected surveys 

Any effective probe of galaxy assembly must sample a 
wide range of masses in order to trace evolutionary con- 
nections between large and small systems. The build-up 
of massive red sequence galaxies may be driven by merg- 
ers with sub-M* systems, so it is essential to trace evolu- 
tion to masses below M* to mitigate against the differen- 
tial growth of the high- and low-mass populations. This 
trade-off between depth and area noted above has led 
to a dichotomy in redshift surveys. Very deep programs, 
such as the Gemini Deep Survey (GDDS; Abraham et al. 
2004) or the Galaxy Mass Assembly Ult ra-deep Spectro- 
scopic Survey (GMASS; |Halliday et"aLl|2008[ ) could not 
cover enough volume to robustly sample the evolution of 
the high mass population at z < 1, while other surveys 
that have traded depth for area do not reach below M* 
with high fidelity. 

An old stellar population at z = 1 with a stellar mass 
of lO n M corresponds to roughly i — 23 mag and r = 24 
mag in the optical, as shown in Figure [jjtop left). The 
extreme optical faintness of such galaxies results in many 
of them being missed in optically-selected surveys despite 
their relatively hig h masses. For exampl e, the selection 
limits for DEEP2 flWillmer et aT] [2006] ) and PRIMUS 
( |Coil et al.||2010| are shown in this figure by the blue 
and green clashed lines, respectively. At z > 1 these 
programs required galaxies to be extremely massive or 
have their optical light dominated by young stellar pop- 
ulations in order to be fall within the survey selection. 
At best it can be difficult to trace the evolution of the 
most massive galaxies with such surveys. At worst, if 
not properly accounted for, color-dependent selection ef- 
fects introduce biases and systematic effects. Extremely 
deep optical limits can be taken as one valid approach 
to ameliorating this problem, with the side effect of an 
overwhelming number of low-mass star-forming dwarfs 
dominating one's source catalog. 

2.2. The potential of an IRAC-selected survey 

The integrated light of all but the youngest stellar pop- 
ulations are dominated by light from the stellar giant 
branch, cool stars whose light output peaks in the near- 
IR. It has long been recognized (e.g., Wright et al. 1994) 
that this results in a 1.6/im "bump" - a peak in bolo- 
metric luminosity - for galaxies with a wide range of star 
formation histories . Indeed, as shown clear ly in (for ex- 
ample) Figure 1 of i Sorba fc S awicki (2010) , this feature 
is nearly unchanging in |Bruzual & Charlot|(|2003| ) models 
of stellar populations with mean ages 100 lVlyr < r <10 
Gyr. Put in other terms, the near-IR mass-to-light ratio, 
M/Li^m changes slowly for all but the youngest stellar 
populations. 

The CSI Survey exploits the wide field and sensitivity 
of the Spitzer Space Telescope with the Infrared Array 



Camera (IRAC) to take advantage of this property of the 
integrated near-IR light from stellar populations in z ~ 1 
galaxies. Combined with the insensitivity to internal and 
Galactic extinction, selection at 3.6/im closely mirrors 
selection by stellar mass. Figure 1 (top left) directly 
compares the evolution of 3.6/im magnitude with the op- 
tical r and i bands: the CSI selection wavelength has a 
dependence on redshift that is much shallower than sur- 
veys selected in the optical. What slope remains for the 
CSI selection function is the unavoidable k- correction: 
over the redshift range 0.7 < z < 1.5, the center of the 
3.6/im IRAC band corresponds to restframe wavelengths 
of 2.1/im > A c > 1.4/im, that is, straddling the 1.6/im 
"bump" of the spectral energy distribution (SED), but 
shifting through this feature as the redshift changes. 

Even so, massive galaxies exhibit a much flatter trend 
of observed magnitudes with redshift in the IR than in 
the optical, and this weaker dependence on galaxy mass 
afforded by 3.6/im-selection of the sample is a key feature 
of the CSI Survey. Our goal has been to make a spec- 
trophotometric survey to characterize the galaxy pop- 
ulations and environments to z = 1.5, unbiased at all 
redshifts down to a stellar mass of M = 4 x 1O 1O M . As 
can be read from Figure 1, this mass limit corresponds at 
z = 1 to r = 25 mag or i = 24 mag, with equivalent limits 
of r = 26 mag and z = 25 mag to reach this mass limit at 
z = 1.2. Our current spectroscopic reduction and analy- 
sis, described below, is reaching an effective photometric 
limit of r = 26 mag (the dashed orange line). 

Figure 1 (bottom left) plots the limiting mass as a func- 
tion of redshift for CSI (solid orange) and other redshift 
surveys. The depth of CSI in stellar mass is substantially 
less sensitive to redshift compared to the others due to 
the IRAC 3.6/im selection, varying by a factor ~ 3 over 
z = 0.5 — 1.5, compared to 1 — 2 dex over the same red- 
shift range for optically- selected surveys. The result is 
that CSI samples nearly uniformly by stellar mass for 
the full, large volume of the survey, and to a lookback 
time of 9 Gyr. 

This critical point is illustrated in Figure 1 (bottom 
right), plotting the depth in stellar mass against the vol- 
ume probed with complete, unbiased samples. DEEP2, 
shown in violet (the line colors are the same as in the 
bottom left), is limited in both area and mass depth. 
PRIMUS's 9 degs 2 is limited in depth, and thus only 
probes an unbiased volume comparable to our first 5 
degs 2 . When CSI reaches its goal of 15 degs 2 , the sur- 
vey will cover an unbiased volume equal to the SDSS 
with similar depth in stellar mass. The large areas avail- 
able from legacy Spitzer IRAC surveys — both wide and 
deep, and the freedom from foreground and internal ex- 
tinction at this wavelength, allows the construction of 
uniformly deep, homogeneous photometric samples for 
spectroscopic follow-up. 

CSI reaches factors of 2-6 deeper than DEEP2 in mass, 
over an area ultimately 8 times wider, and redshifts suffi- 
ciently accurate to characterize environments by directly 
identifying groups and clusters. With such data we aim 
to make the first group catalog at z = 1 that is sta- 
tistically comparable to SDSS, and thus enable the first 
detailed environmental characterizations of galaxies at a 
time when the universe was less than half its current age. 

In this first paper, we describe the details of our 



CSI Redshift Survey: I. Description and Methodology 



3 




Redshift 



Unbiased Volume (Gpc 3 ) 



Figure 1. (top left) Apparent magnitudes in r, i, ana o.o/im as a mnction 01 reasnin, ior passively evolving sienar popu lations {a ^ Zf 
with stellar m asses 5M* and M */2 (logM* ~ 10.85 at z < l; |Drory et al.|2009| ). The magnitude limits for DEEP2 ( |Willmer et al.|2006) 
and PRIMUS ( Coil et al.|2010|) are drawn. Such optical flux limits only cover the most massive passively evolving systems or those galaxies 



using SExtractor ( Bertin fc Arnouts| [l996). 



3.1. 



with young unattenuated stellar populations, biasing galaxy samples at early times. The shallow dependence of the 3.6/im magnitude on 
redshift yields a selection with significantly less bias against old systems, (bottom left) Limiting stellar mass of faint galaxy surveys by 
redshift. By using the IRAC 3.6/im band, the CSI survey (the solid orange line) traces stellar mass more uniformly than s amples selected 
in the optical. Our sample reaches stellar masses equivalent to the present day Milky Way (6.6 x 1O 1O M0; McMillan 2011) out to z = 1.4, 
almost order-of-magnitude lower than DEEP2, and half the present day Milky Way at z = 0.9, a factor ot two deeper than DEEP2 or 
PRIMUS, (bottom right) Volumes probed with complete, unbiased samples for several redshift surveys as functions of limiting stellar mass. 
When the 15 deg 2 is completed, CSI's volume will be more than an order of magnitude larger than DEEP2. The volume traced by the 
first 5.3 degs 2 (this paper) is shown by the dotted line. 

data reduction and redshift fitting, redshifts for the first 
37,000 galaxies in 5.3 degs 2 of the SWIRE-XMM field 
(Figure [2]), and an initial characterization of the general 
galaxy populations in our stellar mass-limited sample. 

3. DATA 

A project of this type and scope requires attention to 
detail and care in the processing of a broad range of data 
from different sources. In this section we describe the 
imaging data and reductions that underpin the broad- 
band flux measurements for the sample, both for the 
ultimate goal of fitting SEDs, but also for defining the 
selection criteria and incompleteness functions. Unless 
otherwise specified, all object detection was performed 



Photometry 

The SWIRE Legacy Survey ( [Lonsdale et"aT|2QQ3| > ob- 



served several large fields with IRAC to a depth suitable 
for our survey. Three of these fields are accessible from 
southern telescopes, providing up to 23.8 deg 2 of cover- 
age at 3.6/im. Basic properties of the three fields are 
given in Table M Of these fields, 15.3 deg 2 have sup- 
plemental optical imaging that is publicly available. Of 
the 9.1 degs 2 of IRAC imaging in the XMM-LSS field, 
6.9 degs 2 is covered by the CFHT Legacy Survey Wl 
ugriz imaging. In this section we discuss our analysis 
of the SWIRE XMM-LSS IRAC data, our reprocessing 
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Figure 2. A schematic of observations in the first 5 degs 2 of the SWIRE-XMM field studied by the Carnegie-Spitzer-IMACS survey. 
The SWIRE IRAC imaging field is outlined in red. The CFHTLS-W1 optical data reanalyzed by us is shown in green. The first set of 
NEWFIRM J and K s observations are marked in violet. The positions of the IMACS slitmasks are outlined in blue, with the blue points 
marking the positions of the 37,000 galaxies with CSI redshifts. The field of view of IMACS is comparable to the size of the full moon, 
illustrated in the lower right. 

of the CFHTLSW1 ugriz imaging and subsequent pho- 
tometry of the 3.6/mi catalog, as well as our observations 
at J and K s of the field using NEWFIRM flAutry et aL 
20031) on the Mayall 4m telescope at Kitt Peak National 



Table 1 

Basic Properties of the Southern SWIRE Fields 
Targeted by CSI 



O bservatory. 

3.1.1. Spitzer-IRAC Imaging 

The SWIRE Legacy Survey was a program undertaken 
to trace galaxies by stellar mass back to z = 2 ( |Lons-| 
|dale et al.|[2QQ3| ). The IRAC images of the XMM-LSS 
field were obtained from the archive at IPAC. The re- 
ductions and proces sing of these data were described by 
( |2005[ ). In order to minimize contamina- 
ted; catalog by artifacts around stars, we 
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configured SExtractor so as to ignore elliptical regions 
around bright stars. A 'mexican hat' convolution kernel 
was used for object detection, resulting in 585,159 ob- 
jects in the 3.6/im catalog of the XMM-LSS field. Fluxes 
were m easured in a manner described by Sur ace et al.| 
(2005). Down to our selection limit of 3.6 ab = 21 mag 
the 3.6/im catalog contained 266,621 objects. 



3.1.2. 



The Optical Imaging 

The CFHT Legacy Survey flCuillandre fc Bertin|2006 ) 
targeted multiple fields using the one-degree MegaCam 
imager around the sky with a broad range of scientific 
goals. The "Wide" survey focused on several astrophysi- 
cal questions, with the Wl field providing almost 7 deg 2 
of overlapping ugriz coverage in the SWIRE XMM-LSS 
field. Unfortunately, the processed images and catalogs 
available at Terapix were not entirely suitable for our 



purposes, owing to uncertain astrometry and improper 
defringing of the i and z data. Therefore, we obtained 
the complete set of calibrated frames from the CFHT 
archive and processed the data using the following addi- 
tional steps. First, we constructed new fringe frames in 
i and z by medianing scaled exposures obtained within 
a single night. These were then rescaled and subtracted 
from the individual exposures. Some bright objects did 
bias these medians when the number of exposures was 
small (i.e. 7V<10), leaving faint traces in the resulting 
fringe frames. In general, however, the greater sky uni- 
formity after subtraction of these new fringe frames im- 
proved the depth of our resulting catalogs by ~ 0.2 to 0.5 
mag compared to the on-line catalogs. We constructed 
sky frames as well for the u, g, and r bands using a simi- 
lar methodology, though not restricting the construction 
to data collected within single nights. 

New astrometric solutions were derived for each ex- 
posure using the IRAC catalog as a set of deep astro- 
metric standards. Cubic solutions for each chip were 
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derived with a typical RMS scatter of 0"15. For those 
regions of the MegaCam images that did not overlap 
the SWIRE data, we supplem ented the catalog with the 
2MASS point source catalog ( |Skrutskie et al. 2006), in 
order to prevent the solutions from diverging at the edges 
of the IRAC field. 

Using these new astrometric solutions, and the pho- 
tometric solutions from the headers of the individual 
frames, we constructed cosmic-ray-cleaned image stacks 
with 0"185 pixels, 30 arcmin on a side and evenly spaced. 
These smaller images were more easily managed than the 
larger image formats provided by Terapix, which allowed 
us to distribute the image analysis and photometry tasks 
over multiple processors. The zeropoints were checked by 
comparing the photometry of moderately bright objects 
with the data in the catalogs from Terapix and we found 
systematic offsets less than ±0.03 mag in every case. Ob- 
ject catalogs were generated using SExtractor, and these 
were matched to the IRAC catalog with a tolerance of 
1"5 arcsec, with the goal of using the optical image char- 
acteristics to aid in star/galaxy separation, and to deter- 
mine which I/L4C-selected objects have centroids with 
optical centers falling far from the defined slit positions. 

3.1.3. The Near-IR Imaging 

NEWFIRM flAutry et al][2003l ) is a wide-field (27' x 
27') near-IR camera deployed by NO AO at the Mayall 
4m at Kitt Peak from 2007 to early 2010. During the fall 
semesters we imaged the XMM-LSS field in J and K s . 
Typical exposure times were 70 min/pixel in J and 32 
min/pixel in K s . The seeing ranged between 0.8 — 1.4 
arcsec (FWHM). 

The data were processed using a fully automated, cus- 
tom pipeline, written as a prototype for a wide-field im- 
ager being deployed at Magellan. The basic steps in the 
reduction were: (1) subtraction of the dark current; (2) 
correction for nonlinearity; (3) division by a flat-field; 
(4) masking of known bad pixels; (5) construction of 
first-pass sky frames; (6) derivation of image shifts, in 
arcsec, on the sky; (7) stacking of the first-pass sky- 
subtracted frames; (8) generation of a deep object cat- 
alog; (9) construction of object and persistence masks 
for second-pass sky estimation; (10) temporary interpo- 
lation over objects, persistence, and bad pixels for each 
frame; (11) constructing bivariate wavelet transforms of 
these masked frames; (12) fitting the time variation of 
the wavelet transforms of the sky using shorter frequen- 
cies for larger spatial scales; (13) reconstruction of model 
sky frames by inverting the temporally smoothed wavelet 
transforms; (14) subtraction of the model sky frames; 

(15) re-derivation of image shifts, in arcsec, on the sky; 

(16) final stacking of the sky-subtracted frames, includ- 
ing the generation of sigma and exposure maps; (17) 
identification of objects in the 2MASS point source cat- 
alog, restricted by 2MASS PSC quality flags; (18) appli- 
cation of rotation, translation, and scale to the camera 
distortion based on the 2MASS objects; and (19) calcu- 
lation of image zero-points using the 2MASS objects. 

Using the zeropoints of each image, we constructed 
image mosaics 30' on a side centered at the locations 
of our ugriz mosaics, but with 0"4 pixels. 

3.2. Aperture Photometry 




Figure 3. Construction of the new "Uniform Dispersion Prism," 
built for CSI. Dr. S. Shectman designed the UDP to have a resolu- 
tion of R ru 25 from 7500 A to lum. The eight layers are made from 
thin prisms of S-FPL51 and N-KZFS2, to make a stack of glass 150 
mm thick. This prism has a resolution with a mild dependence on 
wavelength (see FigureJZl) co mpared to the first prism deployed in 
IMACS (see |Coil et al.[20l0| ). 

Magnitudes in ugriz JK S were derived using aperture 
photometry within a range of circular apertures for the 
entire SWIRE catalog. Because the seeing varied with 
wavelength and pointing, we convolved the z-band im- 
age stacks with Gaussians to simulate the effects of 
poorer seeing in the ugriJK s data. The offsets in mag- 
nitude from the degraded z images were applied as PSF- 
corrections to the ugriJK s aperture magnitudes. While 
the details of the PSF and potential blending are crit- 
ical for small apertures and for modeling the profiles 
of the objects, our choice of fitting the SEDs to much 
larger-aperture (D = 4" ) magnitudes allows us to utilize 
a simple and economical approach to the PSF correc- 
tions. This choice was especially important given the 
enormous size of the object catalog, while reducing the 
systematic errors in matching the aperture magnitudes 
to the IRAC 3.6 jam fluxes. 

3.3. Spectroscopy 

Between 2008 and 2009 the SWIRE XMM field was 
targeted in 31 multislit mask exposures with the IMACS 
f/2 camera configuration and the Low Dispersion Prism 
(LDP) designed by S. Buries for the PRIMUS redshift 
survey flCoil et al.|20To] ). In 2010 we observed 29 SWIRE 
XMM masks using an innovative eight-layer disperser 
called the Uniform Dispersion Prism (UDP, see Figure 
[3]). Although both of these dispersers produce spectra 
with median resolutions of A/AA ~ 30, the dispersion 
curve of the UDP is much flatter, providing significantly 
higher resolution out to 1 /am than the LDP at the ex- 
pense of marginally lower resolution in the blue (Figure 

Previous work by the PI on a similar program had 
obtained adequate S/N ratios with exposure times of 3 
hours down to z' = 23.3 mag using less sensitive SITe 
detectors (Patel et al. 2009ab, 2010, 2011). Significantly 
more sensitive E2V detectors were installed in IMACS, 
thanks to support from the NSF's TSIP program (see 
Dressier et al. 2011); these CCDs boosted the through- 
put of the instrument by factors of 2-3 in the far red. We 
observed with these dete ctors using the Nod and Shuf- 
fle mode of IMACS (see |Glazebrook fc Bland-Hawthorn 
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Figure 4. (Figure reproduced from Dressier et al. 2011) (left) Multi-slit mask des igned for use with the Low Dispersion Prism, (top 
right) Example LDP spectrum — nod & shuttle |Glazebrook & Bland-Hawthorn 200T| produces doubled object and sky spectra. Night sky 
lines [O I]5577A and [O I] 6300 A are marked. Below the LDP spectrum is one obtained with the UDP, with its more uniform dispersion 
(bottom right). The resulting difference in resolution is shown in the bottom right. The resolution provided by the new Uniform Dispersion 
Prism, designed for CSI, provides an improved ability to trace the 4000A break and [OII]3727A to z = 1.4 when coupled with IMACS's 
red sensitive e2v detectors. 

• Identify lines and fit 2D wavelength solutions using 
isolated helium lamp exposures on a chip-by-chip 
basis; 



2001), and accumulated exposure times of 2h per galaxy 
tor more than 90% of the sample, using individual inte- 
grations with durations of 30 min (15 min per position). 
Typical slit lengths were 5 arcsec. The positions of the 
slit masks are shown in Figure [2] 

Spectrophotometric standards were obtained during 
most observing runs, with a consistency of 5% from run- 
to-run. Helium calibration lamps illuminating the de- 
ployable flat-field screen were taken approximately every 
hour (i.e., every other science exposure). In this section 
we describe the basic steps involved in reducing the mul- 
tislit prism data using the custom pipeline th at had been 



written for the earlier cluster programs (e.g. |Patel et al. 
2009a|b[|20TT ). 



3.3.1. Wavelength Calibration 

Two critical aspects of the reduction of low-dispersion 
prism data involve mapping wavelengths on the detector, 
and transposing between sky coordinates of objects and 
CCD coordinates. Too few calibration lamp lines can 
be used to define the solution for a given slitlet, so we 
derive global wavelength mappings over an entire CCD 
using all of the slitlets simultaneously. Thus, lines in a 
single spectrum that may be corrupted by bad columns 
or pixels will not affect the solution for that slitlet, as 
the ~ 3000 helium lines over a CCD frame constrain the 
fit. 

The following are the basic, automated steps we have 
implemented in our pipeline: 

• Derive mapping of sky coordinates to CCD coordi- 
nates using an image of the slitmask; 



• Refit new centroids for blended helium lines using 
nonlinear simultaneous Gaussian fitting on a slit- 
by-slit basis; 

• Fit for improved mapping of 2D wavelength solu- 
tions; 

• Shift the wavelength maps using 2-3 night sky 
emission lines (e.g., [O I] 5577A, Na 5890A, [O I] 
6300 A); 

We start with a distortion map for the camera, defined 
from previous exposures of the field around the globular 
cluster Palomar 5, to compute the approximate mapping 
of sky coordinates to CCD coordinates for the objects in 
the multislit mask. SExtractor is run on a direct image 
of the slit mask and a simple pattern analysis matches 
up the predicted positions of the slits and the measured 
positions of the slits to create an adjusted mapping of 
sky coordinates to CCD coordinates. 

With a theoretical dispersion curve for the prism, we 
generate a trivariate mapping of sky coordinates to CCD 
coordinates that is wavelength dependent. SExtractor is 
run on a helium comparison lamp image and the result- 
ing catalog of the 3,000 to 4,000 helium lines is pattern- 
matched to the predicted positions of 8 unblended helium 
lines (out of the 11 lines in our line list). New wave- 
length mappings are solved for as the order of the fit is 
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gradually increased. The nearly final solutions generated 
at this stage are first order rescalings of the theoretical 
dispersion, with coefficients that are cubic (check!) poly- 
nomials of the sky coordinates. The typical RMS at this 
stage is 0.4 pixels (check). With these solutions, we can 
now generate predicted positions for the complete he- 
lium line list and fit Gaussian profiles to the full line 
list for each slit. Within each slit, the Gaussians are 
fit simultaneously, providing accurate positions for both 
the unblended and blended helium lines. Using these 
new positions for the helium lines in CCD coordinates, 
we derive through iteration, the trivariate mapping be- 
tween CCD coordinates and both wavelength and sky 
positions, with a typical final RMS scatter of 0.25 pix- 
els. For LDP data this scatter corresponds to ~ 50 Aat 
9000 A, or Sz ~ 0.005. This large uncertainty in the 
wavelength calibration in the far red is due to the fact 
that the dispersion reaches ~ 200 A/pixel at 9000 A; the 
scatter of ~ 50 A introduces a 25% uncertainty in A A at 
such wavelengths when computing the flux calibration. 
Because the resolution with the UDP is 3x higher at 
these red wavelengths, this additional source of uncer- 
tainty is negligible for those data. 

The final step involves cross-correlating simulated 
night sky emission lines ([OI] 5577 A, Na I 5890 A, and 
[OI] 6300 A) placed at their predicted positions with the 
spectra in the science frames, on a slit-by-slit basis. The 
median offset in both x and y is then applied to the wave- 
length mappings found from the nearest helium lamp ex- 
posure. 

3.3.2. Extracting Spectra 

Due to the unique nature of these prism data, the sci- 
ence frames must also be processed in a nonstandard 
fashion using custom written routines. We take particu- 
lar care in the extraction of spectra because of the small 
number of pixels covered by each object, the faint opti- 
cal limits we expect to probe, and the proximity of the 
objects to slit ends where residual sky counts can bias 
simple extraction algorithms. Starting with the two 2- 
dimensional spectra (here denoted "A" and "B" ) of each 
galaxy that come from the nod-and-shuffle observations, 
the basic processing steps are as follows: 

• Subtract spectrum A from B, and B from A; 

• Divide by a normalized flat-field, using pixels only 
covered by spectrum A; 

• Optimally extract each object within a given ex- 
posure, adopting a wavelength-dependent Gauss- 
Hermite expansion of the spatial profiles for each 
object, and using the pixels of the A and B spectra 
simultaneously; 

• Solve for the global translation, rotation, and scale 
changes in the object positions due to slitmask mis- 
alignment; 

• Fix the object positions and redo the optimal ex- 
tractions, deriving new Gauss-Hermite moments 
and ID spectra. 

• Combine the data from all exposures of an object 
into a single extracted spectrum. 



A more detailed description of these steps follows. Ini- 
tially we subtract the night sky background spectrum, 
using the "B" spectrum as the sky for the "A" spec- 
trum, and vice versa. Next we divide the results by a 
normalized flat-field using the pixel locations of the "A" 
spectrum for both "A" and "B" spectra (reminder to 
the reader unfamiliar with the details of Nod & Shuffle: 
electrons counted in the "B" pixels originated in the "A" 
pixels). 

Recovering the one-dimensional spectra of the objects 
in the slitlets is now possible, but several additional steps 
and some care are required, because of the proximity 
of the objects to the ends of the slitlets, and given the 
relatively short nods (1.6 arcsec). Our procedur e is a 
modi fication of the standard optimal extraction (Home 



1986) with several key differences. Crucially we can use 
both the "A" and "B" spectra simultaneously to mea- 
sure the spatial profile of a given object. We employ a 
Gauss-Hermite decomposition of the spatial profile, with 
moments that are low-order Legendre polynomial func- 
tions of wavelength. An iterative routine also solves for 
the centroid of the Gaussian as a low-order polynomial 
of wavelength. The second moment (e.g., standard de- 
viation of the Gaussian) is also determined as yet an- 
other low-order function of wavelength. A first guess 
for the one-dimensional spectrum of the object can be 
obtained using this Gaussian approximation for the spa- 
tial profile, by solving for the b-spline (Dierckx 1983) 
that fits the "A" and "B" data, both normalized by 
the wavelength-dependent Gaussian spatial profile. With 
this approximate spectrum, we resolve for the parame- 
ters of the spatial profile, including up to 4 Hermite mo- 
ments (10 for spectrophotometric flux standards), each 
also wavelength-dependent. The wavelength binning of 
each slitlet is preserved, defined by the wavelength solu- 
tion at its location. 

With a first set of extracted spectra and, more impor- 
tantly, object moments, in all eight CCDs of a given ex- 
posure, we analyze the difference between the predicted 
object positions and the detected object positions. De- 
viations are well described by a linear function of coor- 
dinates in the slitmask, largely the result of small mis- 
alignments when the observations were taken. We use 
this new linear function to fix the object positions, re- 
compute spatial profiles, and re-extract object spectra. 

Using the spatial profiles of each observation of a given 
object, we have a final procedure that solves for the 
one-dimensional spectrum using all exposures simulta- 
neously, using the flux calibration (see below) so that 
all data are now in physical units. We fix the first and 
second moments to those determined above, and employ 
a similar optimal extraction routine to that described 
earlier, but one which uses the "A" and "B" spectra of 
all the exposures, recomputing the higher-order Hermite 
moments of the spatial profiles and solving for a b-spline 
that best represents the spectrum of the object in a least- 
squares sense. Pixels are weighted according to the in- 
verse of the expected noise. Bad columns and flagged 
pixels are given zero weight. Iteration allows us to flag 
and reject most cosmic rays. The process performs an 
optimal extraction that yields a single ID spectrum that 
matches all the data in a least-squares sense. Given the 
small differences between the wavelength solutions of the 
exposures, we sample the one-dimensional b-spline rep- 
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resent ation of the spectrum at a fixed set of wavelengths, 
defined by the dispersion of the prism at the center of the 
field. This fixed grid greatly simplifies the construction 
of templates for fitting the SEDs. 

Spectra for objects repeated in overlapping slitmasks 
were not combined at this stage, but kept separate — 
for this paper — so that we could better assess data 
quality and empirically estimate our redshift precision. 
Approximately 10% of the galaxies were repeated; we 
discuss these below in the context of the redshifts and 
their uncertainties. Approximately 20% of the objects 
observed with the LDP were also reobserved with the 
UDP. 

3.3.3. Spectrophotometric Flux Calibration 

Accurate flux calibration of the spectra is a critical 
component of the survey because the relative slope of 
a spectrum provides useful information for the determi- 
nation of redshift. For CSI we choose bright hydrogen 
white dwarf standard stars with deep, broad Balmer lines 
that can be detected with the LDP/UDP. Deep Balmer 
lines are required because we expose these stars in the 
alignment star boxes and the positions of the absorption 
lines allow us to refine the zeropoints of the wavelength 
calibrations for the stars when they are not accurately 
centered in the boxes. 

The processing of standard star exposures is similar to 
that described above, with the following modifications. 
First, to capture essentially all of the photons from the 
star, we place it in the center of an alignment star box 
on one of our slitmasks. By doing so we can infer the 
wavelength calibration for the alignment star box by us- 
ing the trivariate wavelength solution of the helium lamp 
exposure obtained immediately after the standard star. 

In order to derive the sensitivity function, we must 
first compensate for the fact that the seeing in the stan- 
dard star exposure leads to different resolution than one 
would obtain with a narrow slit. We make an assumption 
that the seeing was isotropic and that the Gauss-Hermite 
parameterization of the spatial profile is a valid descrip- 
tor of the seeing profile in the dispersion direction. Us- 
ing this profile as a kernel, we deconvolve the extracted 
spectrum using an implementation of CLEAN (Hogbom 
1974 ), convolving the result to a resolution equivalent to 
that defined by the science data (FWHM=4.5 pixels). 

Due to time and weather constraints, spectrophoto- 
metric flux standards were not obtained during every 
night, but the excellent run-to-run repeatability of the 
relative flux calibration from 8500 A to 4500 A for the 
LDP and from 9000 A to 4500 A for the UDP, of approx- 
imately ±5% allows us to use flux standards obtained on 
different nights or during different runs. 

4. SED FITTING 

With flux calibrated spectra and broadband photom- 
etry in hand, a suitable library of template spectra can 
now be employed to estimate spectrophotometric red- 
shifts. In this section we describe the basis functions in 
our templates, implement a generalized maximum likeli- 
hood method and estimate confidence limits for the red- 
shifts of each galaxy along with several other parameters 
and properties. 

Here we discuss the construction of the templates, us- 
ing several continuum components, each derived from the 



Maraston] (|2005[ ) models. The Kroupa (2001) initial mass 
function (IMF) was used, resulting in a median offset 
from "diet" Salpeter IMF (Bell & de Jong 2001) of 0.04 
dex for SSP ages up to about 9 Gyr. 

4.1. Ingredients 

Ideally, a set of spectral templates should not only span 
the broad range of redshifts a survey may probe, but 
also encompass the potential range of optical and near- 
IR properties of galaxies over those epochs. Satisfying 
this latter constraint cannot be done a priori as it is 
one of the chief goals of the project. However, models of 
evolving stellar populations have been used for modeling 
low-dispersion prism data for the purpose of recovering 
redshifts to z ~ 1 (see Patel et al. 2010). Our method 
for CSI echoes that approach, and we describe it here. 

Our templates are constructed as the superposition 
of several continuum components and multiple emission 
lines. The stellar population bases were derived from the 
Maraston ( 2005[ ) models, using the Kroupa (2001) initial 
mass function (IMF)R In building our templates we ex- 
ploit the well known fact that for times r > 1 Gyr after 
the cessation of star formation, optical flux no longer 
provides significant leverage on a galaxy's prior star for- 
mation history (Tinsley 1974). We adopt a simple set of 
stellar population basis functions: these components are 
sensitive to different timescales of a galaxy's star forma- 
tion history; in combination, they reproduce the broad 
range of SED properties seen in normal galaxies. 

The first base component is a constant star formation 
model with a starting epoch of Zf = 5. The time at 
which star formation ceases for this component is a free 
parameter our analysis, a grid of values ranging from 
10 9 yr to 10 10 yr prior to the redshift of observation 
(capped at z = 5), whose values are spaced logarithmi- 
cally with an interval of 0.10 dex. The metallicity of this 
base population is also gridded with values ranging from 
[Z/H]= -1.2 to [Z/H]= 0.6, with an interval of 0.2 dex. 
Redshift is the final gridded parameter, ranging from 
z = 0.005 to z = 2.0 at intervals of Az = 0.005. Super- 
imposed on the base population are up to five piecewise 
constant SFR populations that formed in 200 Myr inter- 
vals centered on lookback times of t = 1,3,5,7,9 x 10 8 
yr, with the same metallicity as that of the base. Four 
cartoon star formation histories built out of such compo- 
nents are schematized in Figure [5] The SEDs of each of 
these stellar components are redshifted, convolved to the 
wavelength-dependent resolution of the instrument, and 
sampled at the wavelengths appropriate for the galaxy 
spectra. Lastly, these are all replicated with extinctions 
of Ay G {0.0, 0.5, 1.0, 2.0} mag in an effort to reproduce 
the effects of complexity in the geometries of dust atten- 
uation than simple screen models. 

In addition to the continuum components, we include 
several emission line components: (1) a single unre- 
solved Gaussian emission line for [OII]3727 A, (2) a blend 
of three Gaussians at [OIII]5007A, [OIII]4959A, and 
H/34861 A, with ratios of 1 : 1/3 : 1/10, to broadly mimic 
the typical line ratios seen in galaxies at the masses we 
expected (e.g. Kauffmann et al. 2003), (3) a Gaussian 

3 This is equivalent to applying an offset of -0.04 dex to the 
stellar M/L ratios of "diet" Salpeter IMFs (Bell & de Jong 2001) 
for single stellar population (SPP) ages up to about 9 Gyr. 
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Figure 5. Cartoon star formation histories for four representative galaxy types. Because optical passbands provide poor leverage on 
star formation histories earlier than 1 Gyr prior to the epoch of observations, we have reduced the complexity of galaxy star formation 
histories to nonnegative combinations of discrete components such as those illustrated in these cartoons. For our purposes, each galaxy 
is modeled with six age-related components, as described in the text, with the oldest component starting at z = 5 and continuing down 
to some time t en d prior to the epoch of observation. Five younger, discrete components of duration 200 Myr allow for the broad possible 
range of complex histories in a galaxy's recent past. Each of these six components is quadrupled, with four different levels of extinction, 
Ay £ {0, 0.5, 1, 2} mag, leading to a total of 24 stellar components with non-negative contributions to the stellar mass of a galaxy. With 
redshift, metallicity, and t en d as our gridded parameters, there are 24 stellar coefficients at each location in the grid. By abstracting star 
formation histories in this way, we have retained the essential information content of on-going star formation, intermediate-age populations, 
and an old, underlying stellar population. 

colors derived from the broadband photometry and ad- 
justing for any color offset did not significantly affect the 
measured redshifts. 

Standard least-squares and x 2 minimization tech- 
niques are susceptible to bad data points and outliers, for 
example, from a few residual cosmic rays or unidentified 
bad pixels, especially given the small number of usable 
pixels in each spectrum (~ 150). We opted to perform an 
iteratively reweighted non-negative least squares fit for 
the coefficients of the template fit at each location in the 
grid, using the weight fun ction of Huber's M-estimator 
dHuberl 119811 iZhangl 119971. Doing so is equivalent to 
HuE 



for Ha6563A, and (4) an unresolved emission line for 
MgI2799 A. Because of the increasing uncertainties in the 
flux calibration of LDP data beyond 8000 A, no emission 
lines were allowed beyond that point in those fits. Given 
the higher quality of the UDP data in the far red, we ex- 
tended this limit to 9000 A in fitting UDP data, with the 
exception of [Oil] 3727 A, which we accepted to 9400 A. 
These restrictions do not greatly affect the redshifts that 
are measured, but they do have some impact on the fit to 
the red continuua of low redshift galaxies. Note that the 
widths of the Gaussian profiles are fixed to that defined 
by the spectral resolution. 

Thus there are in total 24 stellar continuum compo- 
nents and 4 emission line components, each redshifted, 
convolved to the wavelength-dependent resolution of the 
instrument, and sampled at the wavelengths appropriate 
for the galaxy spectra. The broadband magnitudes of 
each component are computed using the filter transmis- 
sion and detector QE curves of the respective imaging 
instruments. 

4.2. Fitting for the Galaxy Components 

Before solving for the 28 coefficients at every location 
in the three-dimensional grid of redshift, metallicity, and 
termination time, we compute a simple constant rescal- 
ing of the IMACS spectrum to the broadband griz pho- 
tometry from the D = 4 arcsec aperture. In general the 
colors of the IMACS spectra agreed within ±10% of the 



minimizing ttuber's lvl-estimator itself, given as L Hubert 
with the associated weight function Wnuber- 
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1.345. 



Near the optimum location, points 
the behavior of L2 = x 2 /2 (i.e. least squares) 
while outside points retain the behavior of L\ = \x\ (i.e. 
minimum absolute deviation), leading to an estimator 
that is robust against outliers but sensitive to the details 
of the distribution close to the optimum location. In 
three iterations the routine effectively minimized this M- 
estimator, instead of the L2 M-estimator, where L2 = 
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X 2 /2. The result is a set of coefficients that are robust 
against any bad data present, as long as the fraction of 
bad pixels is < 25%. 

With the rescaled IMACS spectra and the ugrizJK s 
photometry, we now solve for the template coefficients 
using iteratively reweighted non- negative least squares. 
The coefficients for each galaxy are stored, as well as 
several M-estimators for each grid location, for use in 
the final likelihood analysis (described below). 

4.3. Likelihood Analysis 

Each location in the three-dimensional grid of redshift, 
metallicity, and the termination of star-formation for the 
base 'constant-star-formation' model has a dozen coef- 
ficients, as well as goodness of fit metrics. With these 
grids of population coefficients and M-estimators, confi- 
dence limits on all of the gridded parameters and derived 
properties, such as total masses, stellar masses, restframe 
colors, were derived using a likelihood function 



dC =dzd\ogt end d[Z/H]P(z,\ogt end , [Z/H]): 

exp —Lfjuber 

(z,logt end ,[Z/H])] 



(3) 



where P is the defined set of priors at a given redshift, 
termination time, and metallicity. In our likelihood anal- 
ysis we found that using Huber's M-estimator gave re- 
sults that were most robust to bad pixels without bias- 
ing the contributions of emission lines, though others, 
for ex ample, "least-power" or "Fair" (see, e.g., Zhang 



1997), were also very effective. We caution against us 
ing M-estimators that are insufficiently concave, as pix- 
els "contaminated" by real (but narrow) features such 
as unresolved emission lines are attributed less weight 
than continuum pixels (by definition). With the three 
dimensional likelihood functions, and 24 SFH + 4 emis- 
sion line parameters at each location in the grid, we have 
computed confidence limits for every parameter, and for 
several derived combinations of parameters such as stel- 
lar mass or a broad range of restframe colors. With these 
confidence limits, and a broad array of data quality mea- 
sures, such as S/N ratios, minimum x 2 , X 2 at the peak of 
the likelihood function, and the confidence limits them- 
selves, we can identify the highest quality sample possible 
with the current dataset. 

4.4. Defining the High Quality Sample 

Several criteria are used to eliminate substandard re- 
sults. A number of measures were judged to be helpful, 
including x 2 for the best-fit template at a given S/N 
ratio, the number of pixels not flagged as bad or con- 
taminated, or unphysical inferred rest frame colors. The 
final criterion used to define our high quality sample was 
to restrict the set to those galaxies with formal 95% un- 
certainties in rest frame M g of Ag^M g < 2.0 mag (equiv- 
alent to gm 9 < 0.5 mag). The result is a relatively clean 
sample of 37,000 galaxies, for which we have 44,000 spec- 
tra, over a field of 5.3 degs 2 . For every spectrum, we have 
marginalized over redshift, stellar mass, rest frame col- 
ors, stellar population parameters as per the components 
in the SED fitting, and emission line luminosities, esti- 
mating 68%, 90% and 95% confidence intervals using the 
likelihood functions. 

Example results of the SED fitting illustrating a broad 



range of redshifts and spectral types from our high qual- 
ity sample are shown in Figures [6] and[7| 

Given that our sample is more effectively reaching 
higher redshifts at a given stellar mass than optically 
selected surveys such as DEEP2 and PRIMUS, we now 
turn to characterizing the accuracy of our redshifts. 



4.5. Success Rates and Completeness 

In a spectroscopic galaxy survey, the degree to which 
the population is sampled can be characterized by two 
quantities: the success rate (fraction of targeted galaxies 
for which usable spectra are obtained) and the complete- 
ness (fraction of the original photometric that is observed 
and usable spectra obtained). The latter quantity neces- 
sarily is necessary dependent on the former, and includes 
the effect of slit collisions. In Figures [8] (top) and (bot- 
tom) we plot success rates and completeness as functions 
of 3.6/im, z, and r magnitudes. For galaxies brighter than 
r = 25.5 mag the current pipeline produces a mean suc- 
cess rate of 70% and mean completeness of 38%. In de- 
tail we model the completeness as a 2D function of both 
3.6/im magnitude and i — [3.6] color, and for illustrative 
purposes we have overplotted the mean success rate and 
completeness within broad color bins (though our subse- 
quent analysis employed finer color bins when modeling 
the incompleteness). 

Spectroscopic surveys such as CSI that intend to study 
the density dependence of galaxy properties must care- 
fully model the effect of slit collisions on the complete- 
ness. To accomplish this we divided our sample into 
bins based on local source density. The footprint of a 
CSI spectrum is ~ 200 pixels long, or 2/3 arcmin. For 
every target and observed galaxy, we compute the lo- 
cal source density within 4/9 arcmin 2 . In Figure ^1 we 



show the mean completeness as a function of source aen- 
sity computed on this scale. This monotonic function of 
density is taken to represent the effects of slit collisions. 
We have also computed the completeness functions as 
coarse trivariate functions of source density, [3.6] mag- 
nitude, and i — [3.6] color in and found no statistically 
significant variation in the bivariate ([3.6], i— [3.6]) com- 
pleteness function with source density, other than the 
normalization, as shown in Figure [8j 

Using the trend of the mean completeness as a function 
of source density, and the mapping of the completeness 
as a bivariate function of z-band magnitude and z— [3.6], 
we can directly multiply the two for every galaxy in the 
final sample. In subsequent, and relevant analysis, one 
can employ weights defined as the inverse of these com- 
pleteness estimates. For those galaxies with multiple ob- 
servations, we currently divide these weights by 7V 5 S , the 
number of observations, though more complicated proce- 
dures may be devised at a future date. We have tested 
the validity of these density-dependent completeness cor- 
rections by recomputing the local source densities based 
on the high-quality spectroscopic sample alone. On the 
scales of 2/3 arcmin, the agreement is very good, with a 
scatter of approximately 0.3 dex. On larger scales, of 2 
or 4 arcmin, the agreement is even better with scatters 
of < 0.1 dex at a given density. These corrections, and 
the accuracy with which they can reproduce the origi- 
nal source densities, are crucial for ensuring confidence 
in the galaxy group mass estimates derived below. 
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Figure 6. Example SEDs for blue galaxies, along with their CSI redshifts and 95% confidence limits. The IMACS spectra are shown in 
cyan. The red boxes mark the broadband flux measurements in ugriz JK S [3.6], noting that the SED fitting was performed using ugrizJK s . 
The best-fit stellar population models are shown with thick black lines. When the data require emission line components in the fit, these 
are shown using dark blue. Inset we show the CFHTLS z-band images of the galaxies. In particular, we note the top-left and bottom-left, 
two galaxies for which broadband photometry alone would be insufficient to provide redshift estimates. Low dispersion spectroscopy is key 
to isolating Balmer breaks and emission lines in blue galaxies, leading to redshift uncertainties with very little dependence on spectral type. 
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Figure 7. Same as in Fig. [6] but for red galaxies spanning a range of redshifts in CSI. Note the structure in the SEDs traced by both the 
data and the models outside of the 4000A break. 
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Figure 8. (top) Average success rate as functions of 3.6/xm and i magnitude, where success rate refers to the fraction of slitlets cut that 
led to a high quality redshift measurement, (bottom) The average completeness as functions of 3.6fim and i magnitude, where completeness 
is the fraction of galaxies above the 3.6jitm flux limit within the 5.3 degs 2 area that have a high quality CSI redshift. At a given 3.6/im flux, 
the completeness is a fairly strong function of color, such that it very nearly depends on z-band magnitude alone. We also plot the mean 
completeness as a function of local source density computed on scales of 2/3 arcmin in an effort to account the portion of our incompleteness 
due to slit collisions. 

4.6. Redshift Uncertainties 

We have three different methods at our disposal for 
evaluating the accuracy of CSI redshifts: (1) directly 
comparing our results with several hundred optically- 
selected high-r esolution spectroscopi c redshifts in the 
VVDS sample |Ee Fevre et ah] fl2003| ); (2) deriving un- 
certainties using the large number of object s with re- 
peat mea surements; and (3) employing the Quadri & 
Williams (2010) pairwise velocity approach, in which 
galaxy groups and large scale structures mean that large 
numbers of galaxies lie at the same redshifts, effectively 
providing repeat redshift measurements of cosmic struc- 
tures. All three methods give consistent estimates of 
the redshift uncertainty to r ~ 25 mag and/or z = 1.3. 
At fainter magnitudes and higher redshift there is in- 
sufficient overlap to empirically characterize the redshift 
errors using the VVDS. However, to these limits, these 
checks on our redshift uncertainties all show good agree- 
ment. 



them here, using smaller points in Figure [9j 

For the 191 galaxies with high quality VVDS measure- 
ments, 91% of the redshifts agree to within Sz/(1 + z) < 
0.02, with 96% within Sz/(1 + z) < 0.05. The standard 
deviation for this subsample is cr z /(l + z) = 0.00 8 (us- 
ing a = 1.4 8 x MAD median absolute deviation; [Beers 



4.6.1. Comparison with VVDS 

In our sample of galaxies with high quality redshifts, 
there are 400 with VVDS spectroscopic measurements; 
Figure [9] compares these redshifts. We have visually 
inspected and verified redshifts of the VIMOS spectra 
of these objects common to the VVDS and CSI, in a 
few cases revising VVDS redshifts and quality flags. For 
VVDS objects flagged as 'low quality' (3 & 2), we were 
not able to verify all redshifts; we nevertheless include 



et al.||1990 ). For the 138 with poor quality flags in the 
VVDS subsample the scatter is cr z /(l + z) = 0.014, with 
86% agreeing to within 5z/(l + z) < 0.05. In Figure [9] 
we show the la scaters in different magnitude bins also 
derived using a = 1.48 x MAD. Note that these values 
include a number of galaxies with low quality VVDS red- 
shifts. For galaxies down to the PRIMUS selection limit 
of i = 23 mag, the scatter is cr z /(l + z) = 0.009. Fainter 
than i = 23 mag, the scatter rises to <j z / (1 + z) = 0.025. 
For galaxies selected by CSI to be between 1 < z < 1.5, 
we find cr z /(l + z) = 0.033, though the distribution is 
non- Gaussian and the subsample largely contains objects 
with low quality VVDS redshifts. 

The comparisons between our redshifts and those made 
available by the VVDS have allowed us to confirm that 
our procedures are working well. However, a larger num- 
bers of galaxies is required to accurately characterize red- 
shift uncertainties as a function of magnitude, color, or 
spectral type. Fortunately, we have two additional tests 
to help us probe the accuracy of our redshifts. 

4.6.2. Uncertainties Derived from Repeat Measurements 

The data from individual slit masks were reduced inde- 
pendently, and so repeat measurements can be compared. 
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Figure 9. (top left) CSI redshifts plotted against VVDS redshifts with quality flags 3,4,5. Larger symbols indicate higher quality VVDS 
redshift flags. In orange and blue we show those galaxies observed with the LDP and UDP respectively, (top right) Fraction redshift error 
vs. VVDS redshift. Numbe rs shown with b ins along the x-axis show the la estimate of the scatter, computed using a = 1.48 x MAD 
(median absolute deviation; |Beers et al.|1990| . (bottom left) and (bottom right) Fractional redshift error vs. i-band, and 3.6/im magnitude. 
For galaxies with high quality VVDS redshirt flags, we find a la scatter of 1% in Sz/(1 + z), and that for galaxies between 0.7 < z < 1.0, 
94% have fractional redshift errors Sz/(1 + z) < 5%. 



These comprise approximately 20% of the measured red- 
shifts, providing a sample large enough to be analyzed 
as a function of galaxy properties, such as magnitude, 
color, stellar mass, or redshift. Without prior constraints 
or measurements for the derived properties of these ob- 
jects we adopt the mean of multiple measurements when 
plotting or analyzing the distributions as functions of 
redshift. In Figure 10 we plot the 68% fractional redshift 
differences against redshift, as well as 3.6/im and i-band 
magnitudes. As a function of redshift the half-width of 
the 68% confidence is 0.9% at 0.6 < z < 0.9, 2.2% for 
0.9 < z < 1.2. As can be seen in the third panel, the 
width of the distribution is a stronger function of opti- 
cal magnitude than near-IR magnitude because our SED 
fitting is dominated by the IMACS spectroscopy. Cur- 
rently, we find a = 3% at 22.5 mag < i < 24 mag, and 
a = 5% at 24 mag < i < 25.2 mag. Planned refinements 
in the pipeline are expected to improve the fidelity of 
the extracted spectra for such faint sources as the CSI 
Survey progresses. 
The redshift uncertainties are dissected further in Fig- 



ure [TT] by breaking up the sample into three redshift 
ranges and plotting against S/N ratios of the IMACS 
spectra, stellar mass, star formation activity, and rest 
frame u — g color. From these panels one can see that 
the redshift errors are not strongly correlated with spec- 
tral type or color, unlike the typical dependencies seen 
in photometric redshifts. There is a rather small, subtle 
dependence on spectral type: quiescent galaxies (those 
with a negligible specific star formation rate and hence 
strong Balmer/4000A breaks) and strongly starforming 
galaxies have the greatest precision, while galaxies with 
modest or intermediate SFR have degraded redshift pre- 
cision. Weak [Oil] 3727 A emission at high redshift and 
observed at low resolution can appear blended with the 
4000A break break and produce spectra that are rea- 
sonably well fit by an SED with a Balmer break and 
no [OIIJ3727A. More specifically, [OII]3727A is imper- 
fectly correlated with the SFR inferred from the stellar 
continuum, leading to an additional uncertainty in the 
redshifts at the ~ 1 — 2% level, depending on the S/N 
ratio of the spectra, and even when not particularly sta- 
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Figure 10. Fractional redshift differences between repeat measurements, scaled by y/2, as functions of redshift, 3.6/mi magnitude, and 
i-band magnitude. For example, at 0.6 < z < 0.9, the half-width of the 68% confidence interval is a = 0.9%. At 0.9 < z < 1.2 we 
find a = 2.3%. As can be seen in the third panel, the width of the distribution is a stronger function of optical magnitude than near-IR 
magnitude, because our SED fitting is dominated by the IMACS spectroscopy. We find a = 2.6% at 22.5 mag < i < 24 mag, and a = 5% 
at 23.5 mag < i < 24.8 mag. 

S/N signal from the pairwise velocity histograms. While 
photometric redshift surveys, by definition, result in the 
highest possible spatial sampling, the spatial sampling of 
CSI is limited by slit collisions. This limits the number 
of galaxy pairs available to perform this analysis, com- 
pared to what would be obtained in a purely photometric 
redshift survey. 

We divided our sample into subsets based on r-band 
magnitude and redshift and constructed pairwise veloc- 
ity histograms. For the subsamples based on magnitude, 
we used those galaxies brighter than a given bin as a 
reference set, as suming those galaxies have more precise 
redshifts. While Quadri & Williams (2010) chose to ran- 
domize the positions in their dataset to correct for close 
pairs that arise only in projection, we opted to randomize 
the galaxy velocities since our slit-placement constraints 
lead to nontrivial spatial dependencies. Care was taken 
to properly normalize the histograms derived from the 
randomized distributions, as the number of galaxies in 
them, by definition, is too large by the number of galaxies 
within coherent velocity structures. Because this number 
cannot be precisely defined a priori, it must be deduced 
from the data — from the wings of the histograms. We 
made several realizations to reduce the Poisson noise im- 
posed by the randomization process. 
The results of these experiments are shown in Figure 



tistically significant. This effect is substantially less pro- 
nounced for UDP data, where the higher resolution of 
the disperser shifts the wavelength of such [OII]3727A 
confusion farther to the red. This power of the higher 
resolution UDP is illustrated in the last SED of Figure 
|6j in an old galaxy at z = 1 .27 with [Oil] 3 72 7 A l ine emis- 
sion (likely a LINER, e.g. Lemaux et al. 2010[ ). Again, 
this problem with low resolution spectroscopy is only im- 
portant for "green" galaxies; the bulk of the galaxy pop- 
ulation, lying in the blue cloud and red sequence (with 
strong [Oil] 3727 A and Balmer breaks respectively), is 
unaffected. 

While the abundance of repeat spectroscopic measure- 
ments has allowed us to characterize the redshift uncer- 
tainties, repeated spectra of any given object are fine- 
tuned wit h the same broadband photometry as described 
in Section 4.2 For some galaxies this can in principle in- 
troduce artificially tight correlations between the repeat 
spectroscopic measurements. Presumably, this issue be- 
comes a serious problem only if the broad-band photom- 
etry is allowed to have greater weight in the likelihood 
analysis. This never occurs in our procedures, in par- 
ticular because the number of data points in the UDP 
and LDP spectra far exceed the number of photomet- 
ric bands. Even when the S/N ratios of the spectra are 
low, binning these data over the photometric bandpasses 
yield S/N ratios that still exceed what obtains from the 
photometry for those fainter galaxies. Even with these 
reasons to trust the analysis of the repeat measurements, 
we employ yet another means to assess the redshift errors 
in the next section. 

4.6.3. Uncertainties from Pairwise Velocity Distributions 

Because galaxies largely exist in close pairs, in groups, 
in clusters, and in even larger coherent structures, all 
of which have velocity widths smaller than the redshift 
errors we expect in our data and analysis, the distribu- 
tion of redshift errors can be inferred statistically from 
the distribution of pairwise redshift differences in a given 
sa mple. This technique was fi rst explored and described 
by |Quadri fc Williams] ( |2010[ ) in an effort to derive the 
redshift uncertainties m photometric redshift surveys, 
where a lack of repeat measurements and limited tem- 
plates can hamper one's ability to estimate uncertain- 
ties. Any methodology relying explicitly on clustering 
requires dense spatial sampling in order to extract a high 



12 as the red histogram. Overplotted are the la confi- 
dence limits on our redshifts, shown in blue, and com- 
puted by dividing the 95% confidence limits by four. The 
green filled circles are the expected standard deviations 
of the pairwise velocity histograms for galaxie s with red- 
shift unce rtainties shown in blue. Overall the |Quadri fc| 
Williams (2010) method has reinforced the validity of the 



redshift uncertainties derived from our generalized like- 
lihood analysis down to faint magnitudes, and from our 
comparisons to the relatively bright VVDS sample. 
This result shou l d be of general interest, since the 



Quadri fc Williams (2010) method was in part devised to 
remedy the deficiency that photometric redshift surveys 
do not provide repeat measurements of a given galaxy. 
However, any spatially dense survey will provide repeat 
meas urements of (relatively) c old cosmic structures, so 
the Quadri & Williams (2010) methodology remains a 
valuable technique tor any spatially dense spectroscopic 
or photometric redshift survey — any survey where one 
does not have repeat observations with which to charac- 
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Figure 11. Fractional redshift differences between repeat measurements, scaled by v^2, as functions of S/N ratio (75th percentile of the 
prism spectra), stellar mass, relative star formation activity, and restframe u — g color. There is very little dependence of the redshift errors 
on spectral type or color, unlike what is typically seen for photometric redshift s. 

terize uncertainties empirically. 

Based on these three methods, we conclude that our 
random redshift uncertainties have been reliably char- 
acterized, and also that systematic uncertainties appear 
to be minimal. In particular we stress that our redshift 
errors are not strongly dependent on relative star for- 
m ation activity, spect ral type, or color. As discussed 
by Quadri et al. (2012), redshift errors that are strongly 
correlated with spectral type will bias estimates of local 
galaxy density if care is not taken to ensure that veloc- 
ity windows or velocity linking lengths can encompass 
both red and blue galaxies with equal probability. Given 
CSI's primary goal of characterizing galaxy properties as 
a function of environment, the relative insensitivity of 
our redshift errors on spectral type is a crucial point. 




5. DISCUSSION 

We conclude our discussion of the survey by presenting 
a broad overview of the CSI sample, by tracing basic 
properties of these Spitzer-selected galaxies over time. 
We specifically highlight the selection limits in order to 
give the reader a better sense of the depth of the sample 
in redshift, luminosity, and stellar mass. 

Figure 13 shows the distribution of (top) rest frame 
5-band magnitude and (bottom) stellar mass as func- 



22 24 
r (*na^} 

Figure 12. A comparison of the redshift uncertainties derived 
from the likelihood functions with the mean uncertainty as a func- 
tion of m agnitude derived from the pairwise velocity histogram 
method of Qua dri fc Williams] j2010| ). In blue we plot la redshift 
uncertainties tor the individual galaxies. In green we plot the ex- 
pected standard deviation of the pairwise velocity histograms from 
the blue points. The red shaded bars trace the measured stan- 
dard deviations from the pairwise velocity histograms as functions 
of magnitude. The comparison is quite good, suggesting that the 
uncertainties from our redshift fitting procedures are robust. 
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Figure 13. The distribution of our 3.6/im-selected galaxies in 
(top) restframe £>-band magnitude and (bottom) stellar mass, as 
functions of redshift, where we have normalized the distribution 
at each redshift by the number counts in each redshift bin. The 
green and blue lines indicate the approximate loci for old stellar 
populations at the limits of the PRIMUS and DEEP2 surveys. In 
orange, we show the CSI selection limit of 3.6/im = 21 mag (solid) 
and the effective optical limit of i = 24.7 mag (dashed). The 
maroon contours trace the distribution of "quiescent" galaxies, as 
selected by their restframe (U — V) and (V — J) colors jWilliams et| 
|al.|2009| >. Selection in the near-IR has ensured that CdI does not 
a priori exclude the bulk of passively evolving systems at z > 1. 
Meanwhile, the observing strategy, reductions, and first analysis of 
the data enable CSI to reach more than a magnitude fainter in the 
optical than DEEP2 or PRIMUS. 



tions of redshift. The dark red contours specifically out 
line the distribution of quiescent galaxies, a s selected by 
their restframe U — V and V — J colors (Williams et 
The CSI 3. 6 /am flux limit is shown by the 



all [2009). 

solid orange lines, along with our effective optical limit 
of i ~ 24.7 mag as dashed orange lines. This i-band limit, 
where our spectroscopic completeness falls below 1/3 of 
the maximum (of 50% at i = 21 mag, see Fig. [8|, domi- 
nates over the IRAC selection at z > 1.2. In these plots, 
the sample becomes irreparably incomplete for galaxies 
at masses or magnitudes below the lines. These figures 
show that the goal of producing a large, cosmologically 
representative set of galaxies back to z ~ 1.5 is being 
achieved. CSI is probing galaxies with the present day 
stellar mass of the Milky Way to a time when the universe 
was 9 Gyr younger than today and without explicitly ex- 
cluding galaxies with low M/L ratios. 
For comparison, we plot the DEEP2 and PRIMUS 





Figure 14. (top) Distributions of high quality redshifts for galax- 
ies more massive than half that of the Milky Way. The redshift 
bins are defined by a constant comoving volume in which cosmic 
variance is ~ 20% for red galaxies (dark gray boxes; |Somerville et| 
|al.||2004| ). The apparent decline in N(z) above z = U.Y is due to 
an increase in spectroscopic incompleteness (i.e. a declining suc- 
cess rate, see the bottom panel), (bottom) Applying completeness 
corrections, we convert N(z) to the comoving number density of 
galaxies as a function of redshift. We plot the mean completeness 
as a function of redshift in blue. The light gray shaded region be- 
yond z = 1.2 in the right-hand panel shows approximately where 
the faint optical magnitude limit of the current analysis cuts the 
sample on the red sequence (see Figure^). 

magnitude limits of r = 24.1 mag and i = 23.0 mag us- 
ing the blue and green lines, respectively. These surveys 
can probe unbiased samples down to the stellar mass of 
the Milky Way back to z = 0.9, and half the mass of 
the Milky Way to z = 0.75. CSI's additional reach to 
z = 1 for galaxies with half the mass of the Milky Way 
may only reach 1 Gyr farther back in cosmic time, but 
the comoving volume covered within a constant area on 
the sky is nearly double that of PRIMUS. Such gains in 
sensitivity highlight the importance of an infrared selec- 
tion in crafting a minimally unbiased picture of galaxies 
since z = 1.5. 

(top) shows the distribution of high qual- 



Figure 



14 



ity redshirts for galaxies with stellar masses M > M*/2, 
using redshift bins estimated to have co nstant cosmic 
variance of ~ 20% ( jSomerville et al.|2004| shown in dark 
gray). A large fraction of the decline in TV (z) is due to our 
declining success rates and completeness at fainter opti- 
cal magnitudes (as shown by the blue lines in the bottom 
panel). As can be seen Figure 13, however, selection at 



brighter optical limits, however, would have dramatically 
exacerbated the incompleteness. Cutting at i = 23 mag 
(the PRIMUS magnitude limit) would have eliminated 
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~ 50% of those galaxies at z = 1 in CSI. Selection at 
those wavelengths that more closely trace stellar mass, 
and a methodology for measuring redshifts that does not 
depend crucially on the presence of emission lines at faint 
magnitudes, ensures, at least a priori, that the resulting 
samples are free from bias against galaxies with low M/L 
ratios at fixed mass. 

We have applied completeness corrections and con- 
verted our N(z) figures to estimates of the comoving 
number densities of galaxies as functions of redshift in 
this stellar mass range and plot them in the bottom pan- 
els. The dark gray, again, shows the estimated cosmic 
variance of ~ 20%, which dominates the uncertainties 
(y/N /N is small). The light gray shaded region shown at 
high redshift in the bottom right indicate redshifts where 
our faint magnitude limits impinge on the low mass end 
of this subsample and the selection biases become impor- 
tant. The mean of our completeness estimates as func- 
tions of redshift are plotted in blue. 

Detailed analysis of the evolution in number densities 
of galaxies over cosmic time is beyond the scope of this 
first paper, which has primarily focused on the motiva- 
tion behind the survey and discussion of the handling of 
the data. However, Figure [14] reinforces what previous 
deep surveys covering smaller fields-of-view have found: 
fairly modest evolution in the number densities of nor- 
m al galaxies since z = 1 (|Drory et al.|2009{|Marchesini et 
[aT||2009| |Ilbert et al.||2010[ e.g.). In subsequent publica- 
tions from the Carnegie-Spitzer-IMACS Survey, we will 
begin dissecting the dataset in order to identify galaxy 
groups and trace the e volution of the mass function of 
such groups since z = 1 ( |Williams et al.|2012| ) , and begin 
characterizing the star formation activity and histories of 
galaxies in groups of different mass scales as a function 
of time. 

6. SUMMARY 

We have described the methodology and initial data 
of an ambitious new survey of approximately 2 x 10 5 
galaxies over a volume that encompasses the last 9 Gyr 
of cosmic history. By selecting a sample at 3.6/im from 
Spitzer-IRAC, the CSI survey is the most uniform to- 
date in terms of limiting mass as a function of redshift, 
and will ultimately cover an unbiased volume compara- 
ble to SDSS but at 0.5 < z < 1.5. The power of the 
IMACS spectrograph with low-resolution prisms allows 
us to survey large volumes efficiently and with sufficient 
spectral resolution to detect large-scale structure and to 
measure emission lines from strongly star-forming galax- 
ies and AGN. CSI provides comparable redshift accura- 
cies for red sequence and blue cloud galaxies; a significant 
advantage compared to many broadband photometric — 
and even some spectroscopic — studies. By combining our 
low-resolution spectrophotometry with extended broad- 
band photometry and sophisticated SED modeling, CSI 
bridges the gap between surveys that are deep enough 
to probe galaxies below M* but are small in sample size 
and volume, and those large-sample, large-volume sur- 
veys that do not reach typical, Milky Way-like galaxies 
that are the main event in the history of cosmic evolu- 
tion. In forthcoming papers we will update with more 
detail what we have learned from the first batch of CSI 
survey data. 

Once the three survey fields of CSI have been com- 



pleted at the end of 2012, we will have probed a volume 
equivalent to the SDSS, enabling a vast number of studies 
of distant galaxies. One of the highest priority goals will 
be to take advantage of this volume to simultaneously 
measure the evolutio n of the group mass function (e.g. 
Williams et al.|2012[ ), and the dependence of star forma- 
tion on environment (e.g. Kel son et al.||2012 ). By doing 
so CSI will uncover the extent to which the growth in the 
number density of passive galaxies over the past 2/3 of 
the lifetime of the universe can be attributed to group- 
related processes such as, for example, galaxy-galaxy in- 
teractions (leading to differential number density evo- 
lution between groups and galaxies), and/or a decrease 
in the available fuel for on- going star formation (leaving 
galaxy number densities unchanged). Given that most 
alaxies in groups a re "red and dead" even at z ~ 1 
Kelson et al.|[2012| ), the mass selection, high spectral 
coverage, and color-insensitive redshift accuracy of CSI 
will all be critical elements in characterizing relatively 
poor groups at high redshift. Subsequent CSI analyses 
will also focus on, for example, AGN and any connections 
to group and/or host galaxy prop erties, on differences 
between centrals and satellites (e.g. i Berlind fc We inbergl 
|2002| |Weinmann et alj|200(^van derTB osch et al. 20U7|7 
and on merger rates (e.g. | Williams et al.||201l| ) as lunc- 
tions of environment. 
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